Analytic Partition Function Zeros of the Wako-Saito-Munoz-Eaton 

/3-hairpin Model 
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An analytic formula for the density of states of Wako-Saito-Munoz-Eaton model, for simple classes 
of /3-hairpins, is obtained. Under certain simplifying assumptions on the structure of the turn region 
and the values of local entropy, the partition function zeros are also obtained in analytic forms. 
The zeros are uniformly distributed on a circle, exhibiting a first-order-like nature of the folding 
transition. After introducing a hydrophobic core at the central region of the hairpin, the zeros are 
shown to distribute uniformly on two concentric circles corresponding to the hydrophobic collapse 
and the transition to the fully folded conformations. The dependence of the distribution of the 
zeros on the position of the hydrophobic core, is shown to have a clear physical interpretation. The 
exact partition function zeros for a hairpin with a more complex structure of native contacts, the 
16 C-terminal residues of streptococcal protein G Bl, are also numerically computed, and their loci 
are also shown to be closely approximated by concentric circles. 

PACS numbers: 87.15.ad, 87.15.hp, 87.15.Cc, 64.60.De 



Theoretical studies on protein folding has often been 
performed with simple models incorporating informa- 
tion on the native structure [l- 12 1. Wako-Saito-Muhoz- 
Eaton(WSME) model is one such example 1 1 . de- 
scribed by Ising-like binary variables with long-range in- 
teraction on a one-dimensional lattice. The transfer ma- 
trix formalism was developed so that the exact partition 
function can be computed for any given temperature[9|. 
Since partition function contains all the information on 
thermodynamics, various quantities relevant for confor- 
mational transition of a protein can be calculated. How- 
ever, analysis of partition function zeros 13l-l25f in the 
complex temperature plane, one of the most power- 
ful tools for studying phase transitions, has never been 
performed for the WSME model. The native struc- 
ture has to be specified to define the Hamiltonian in 
the WSME model, and we concentrate on /3-hairpins 
in this Letter. Although a /3-hairpin is a very sim- 
ple structure, it captures nontrivial aspects of protein 
folding because contacts are formed between residues 
far away in sequence. Therefore, /3-hairpins has been 
the subject of extensive researches both by experiments 
and computationsj2H4|, |26I433| , including the study using 
WKMS modelgl, & ©]. Remarkably, we find that under 
certain simplifying assumptions on the values of native 
interactions, not only the partition function, but also the 
partition function zeros themselves, can be obtained an- 
alytically. The exact partition function zeros for more 
complicated cases, which can still be obtained numeri- 
cally, show similar qualitative behaviors as the analytic 
solution. 

The WSME model describes a peptide or protein of 
length N + 1 by an Ising-type variable m, (i = 1 • • • N) , 
which denotes the state of the i-th peptide bond connect- 
ing i-th. and i + 1-th residues. The variable m* takes the 
value or 1 depending on whether the bond is in ordered 



or disordered state. If the entropy of the ordered bond 
relative to the disordered one is denoted as Asi < 0, then 
Aj = exp(— As.i) > can be considered as the effective 
number of microstates of a disordered bond. For a j3- 
hairpin, we assume that the local entropy cost for order- 
ing a bond is same throughout the protein chain0, HI, HI, 
writing the effective number of disordered bond states as 
A = exp(— As). The number of conformations of an or- 
dered bond is 1 by definition. Note that A does not have 
to be an integer in general. The Hamiltonian of WSME 
model is 
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H({m k }) =EE <<AX 
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where is the contact energy of i and j-th bond, Ay is 
1 only if the bonds are in contact in the native structure 
and otherwise. Thus, the contact energy is assigned if 
and only if the corresponding pair of bonds are in con- 
tact in the native structure, and the stretch of sequence 
between them are all in the ordered states. The contact 
energy can either represent the backbone hydrogen 
bond or hydrophobic interaction between the sidechains. 

We concentrate on simplified classes of /3-hairpins 
where the i-th bond form native contact only with the 
N — i + 1-th bond, the one at the opposite side of the 
hairpin. The native structures of these hairpins for even 
and odd values of TV are displayed in FigQJa) an d (b), 
where the contacts are denoted by thin lines. The model 
with odd N (Fig[ljb)) is more realistic in that there is an 
additional bond in the turn region, but the model with 
even N (Fig[TJa)) has an advantage that an analytic for- 
mula exists for partition function zeros themselves, in the 
limit large A. These models have simplifying character- 
istics that the lines of contacts do not cross each other. 
When they do, as in Fignjc), the analytic formula for 
the density of states becomes more complicated and con- 
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FIG. 1: Native conformations of simple models of /3-hairpins, 
where the number of bonds N is (a) even and (b) odd. The 
density of states can be obtained analytically, and the zeros 
themselves can also be obtained in analytic forms for an even 
value of TV in the limit of large A. (c) An example of a more 
complex hairpin model with the lines of native contacts cross- 
ing each other. The exact density of states can be computed 
numerically using transfer matrix. 



sequently less useful. However, one can easily compute 
the exact density of states and partition function zeros 
for a given set of parameters, using a transfer matrix 
We will restrict ourselves to an even value of N without 
lines of native contacts crossing each other (Fig|T] (a)), 
unless stated otherwise. 

Let us call the contact between the i-th and N — i + 1-th 
bonds as the i-th contact, and rewrite the corresponding 
energy as e 4 = e^jv-i+i (i = l,-" , N /2) for simplicity 
of notation. It is clear that the broken native contacts 
can appear only as a sequential stretch in the tip region, 
due to the restriction that the native contacts can form 
only when all the intervening bonds are ordered. Let 
us suppose that i-th native contacts with i < j are all 
broken and the rest are intact. The corresponding energy 
value is 



N/2 
5=3+1 



= E N -J2e l (0<j<iV/2) (2) 



where -Ejy = Yli=i e i ^ s the energy value of the fully 
folded conformation. Since at least one of the bonds 
forming the j-th contact has to be disordered, they can- 
not be both in the ordered states. Therefore, the total 
number of states these pairs can be in is (A + l) 2 — 1. All 
the other bonds with broken native contacts can be in 
any of the A + 1 states, whereas those forming the native 
contact is in the ordered states whose number is 1 by def- 
inition. The total number of conformations for a given 
value of j is thus obtained by multiplying these numbers 
of bond states: 
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When A is large enough so that 
A 2 + 2A + 1 



((A + l) 2 - 1) (A + l) 2 ^ 2 (1 < j < N/2) 



(3) 



where j = corresponds to the fully folded conformation. 
If all the native contacts are due to hydrogen bonds, we 
may assign equal energy value e, = e < to each con- 
tact, and the partition function for an even value of N 
is obtained in analytic form from Eq.Q as a function of 



z = e^: 
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(A + l) 2 
A 2 + 2A 



N/2 



(4) 
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A 2 + 2A 

we may approximate the partition function as 



Z ~ z~ N ' 2 



A 2 + 2A 



N/2 
j=0 
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(5) 



(6) 



so that the solution to the equation Z(z) = is obtained 
analytically as 
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/ 2irij 
(A + l)2' eXP ^AT/2+l 



(j = l---iV/2). (7) 



That is, the zeros are uniformly distributed along the cir- 
cle of radius 1 / (A+l) 2 except for the point on the positive 
real axis. Since the physical region is < z < 1, we see 
the folding transition exists as long as A > 0, accord- 
ing to Eq.([7]), with corresponding folding temperature at 
Tf = -e/(2fc B ln(A + 1)). The folding temperature de- 
creases as A increases, since the unfolded conformation 
becomes more favored entropically. As is well known, 
the uniform distribution of zeros on a circle leads to a 



first-order transition in the limit of infinite iV|l3|. 

Although the approximation Eq.([5]) is better for larger 
A, the locations of zeros fit quite well with the analytic 
solution Eq.Q also for small values of A, even for the ex- 
treme case A = 1 (As = 0), as can be seen in Fig|2] 34]. 
In the figure, the partition function zeros for N = 14, 
obtained by solving the polynomical equation Z(z)=0 by 
MATHEMATICA, are plotted on the complex plane of 
(A + l) 2 z for several integer values of A. The straight 
lines at angular interval of 7r/4 intersecting are also drawn 
along with the unic circle, so that their intersection are 
are the analytic solutions Eq.©, except for the one on 
the positive real axis. We see the distances of the zeros 
from the origin are slightly larger than the radius of the 
circle, because the entropic cost of the fully folded state 
is overestimated in the approximation Eq.©. Note how- 
ever that the deviation is visible only near the negative 
real axis, and almost negligible near the positive real axis 
which is the physically meaningful region. 

For odd values of AT, the density of states is almost 
the same as that of the even N with the same number 
of native contacts, except that the number of the fully 
unfolded states is larger, 



n(£(jv-i)/ 2 ) = ((A + l) 3 -l) (A + l) 



N-3 



(8) 



due to the contribution of the extra bond at the turn 
which is liberated. The zeros for N = 15, corresponding 
to the same number of native contacts as N — 14, are 
also plotted in Fig. ([2]). As can be seen from the figure, 
they are located inside the circle, due to the fact that now 
the fully unfolded structure is more favorable compared 
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FIG. 2: The partition function zeros of WSME model of j3 
hairpin models with 7 native contacts and uniform interac- 
tion strengths, in the complex plane of (X + l) 2 z. The analytic 
solution Eq.([7J) lies on the intersection of the circle and the 
straight lines, which fits quite well with the numerical solu- 
tions for even chain length N = 14. The zeros for odd chain 
length N — 15 are also plotted. 
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FIG. 3: Intermediate conformation where a hydrophobic core 
(double line) is formed but the tip region is unfolded. The 
dashed lines denote broken native contacts, (a) When a hy- 
drophobic core exists near the tip of the hairpin, it can be 
formed only in the fully folded conformation, (b) When the 
hydrophobic core is near the turn, the intermediate conforma- 
tion is almost indistinguishable from unfolded conformation 
in terms of energy and entropy, (c) Nontrivial folding inter- 
mediates can be formed when the hydrophobic core is located 
near the middle of the hairpin. 



to that of even N, because of additional entropic contri- 
bution from the extra bond at the turn. The deviation 
from the even length chains increases as A increases, as 
to be expected. 

By introducing hydrophobic interaction in addition to 
the hydrogen bond, we can observe the collapse transi- 
tion to an intermediate where the hydrophobic core is 
formed but the tip region is unfolded(Figj3jc)). Again, 
the partition function zeros can be obtained analytically 
for a special case. Consider the case where N is even, 
n = N/2 is odd and the hydrophobic core consists of a 
single additional interaction at h = (n + l)/2, with (free) 



energy AGsc = <Z e < with some integer q. Also, we 
assume that each hydrogen bond contributes the energy 
AHhb — V e < where p is also an integer. The partition 
function can then be factorized as 



Z : A_+JA f A 2 + 2A + 1 + g (A + i; 



(A + 1) 2L A 2 +2A 



2j z Pj 



3 = 1 



-^(A + l) 2j z w+9 ] 



j=h 



h-l 



£((A + 1)V) 



3=0 



X [1 + (A + l) 2 



(9) 



where the approximation ([5]) is used as before. The first 
and the second factors give two concentric circles for ze- 
ros: 

[ 27TZ 7 

■exp(— -i) (j = !,-•• ,hp-l) 



3 (A + 1) 2 /p hp 

(i = 0,--- ,hp + q-l), (10) 



exp(^f) 



3 (A + iyh/{hp+q) 

again showing the first-order-like nature of the transition. 
It is easy to see from the analytic solution of the loci (fTO]) , 
that the folding and collapse transition occurs at Tj = 
-ep/[2fcsln(A + l)] and T c = -e{hp+q)/[2k B h\n{\+l)]. 
As to be expected, larger value of AH^ corresponds 
to higher value of T c and denser distribution of zeros 
on the outer locus signifying sharper collapse transition. 
Also, for q = the two circles collapse to one circle cor- 
responding to the folding transition, Eq.©, by setting 
p = 1 without loss of generality. The exact partition 
function zeros for n = 9, p = 1, q = 2, and A = 2, 
with an extra hydrophobic interaction at the fc-th con- 
tact, are plotted on complex z-plane, in FigJH along 
with the circles at radii 1/(A + l) 2 / p = 1/9 ~ 0.111 
and 1/(A + \ f h /( h v+i) = 1/9 5 / 7 ~ 0.208. We see that 
the zeros for k — 5 are extremely well described by the 
analytic solution ([TU)) . being distributed on inner and 
outer circles at angular interval of 2%/hp — 2tt/5 and 
2n/(hp + q) = 2tt/7 respectively. We see that as the 
position of the hydrophobic core is moved toward the 
tip, the radius of the outer locus decreases since the in- 
termediate becomes unfavorable entropically. Also, the 
density of zeros of the inner locus decreases, due to the 
fact that the intermediate and the fully folded conforma- 
tion becomes less distinguishable(Fig(3][a)). Eventually, 
at k = 1 the zeros form one locus corresponding to the 
folding transition. On the other hand, as the hydropho- 
bic core moves toward the turn, the radius of the outer 
locus increases and its density decreases, because the en- 
tropy of the intermediate increases and it becomes less 
distinguishable from the unfolded state(Fig|3Ub)). 
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FIG. 4: Partition function zeros of WSME model of f3 hairpin 
models with a hydrophobic core. There are three additional 
zeros for k — 9 outside the range of the plot, at z — —0.45 
and z = 0.28 ± 0.4L. 



So far we have concentrated on general class of sim- 
plified hairpin models. Real /3-hairpin, 16 C-terminal 
residues of streptococcal protein G Bl (GB1), was also 
studied with WSME model@, which includes crossed 
lines of native contacts (FigJT{c)). The density of states 
was calculated using the transfer matrix formalism 
where the native contacts are given in rcf. '■'> . The hy- 
drogen bond and hydrophobic interaction energies are 
AHhb = —1.1 kcal/mol, AGsc = —2.0 kcal/mol, and the 
local entropic cost of folding is As = —3.12 cal/Kmol, 
which corresponds to p = 11, q = 20 with e = 
—0.1 kcal/mol, and A = 4.80. The zeros are obtained 
as solutions of a 137-th order polynomial equation, which 
are plotted in the plane of e@ t in Fig.©. We see that the 
loci of zeros also form concentric circles with radii 0.727 
and 0.845 corresponding to temperatures T = 158 K and 
T = 299 K. Note however, that the angular distribution 
of inner circle is not uniform. The transition tempera- 
ture T — 299 K is quite consistent with 297 K defined 
in terms of kinetic rates and free energy in refs.jH, 
A two state transition behavior at this temperature was 
reported experimentally and also confirmed by the- 
oretical study using WSME model[9J. In fact, since the 
distribution of the zeros on the inner locus is very sparse, 
especially near the positive real axis, any possible transi- 
tion from hydrophobically collapsed intermediate to the 
fully folded state near T ~ 158 K might be quite smooth, 
making it hard to be observed as a meaningful conforma- 
tional transition. 
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FIG. 5: The partition function zeros of WSME model of /3 
hairpin, 16 C-terminal residues of streptococcal protein G Bl. 

Partition function zeros of protein models with com- 
plex structure of inter-residue contacts can also be stud- 
ied using the transfer matrix formalism, which is left for 
future study. 

The author thanks Seung-Yeon Kim and Jae Hwan Lee 
for useful discussions. 
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